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SEQUENCE LISTING 

SEQ ID NO: 1 

SEQUENCE LENGTH: 4008 base pairs 
SEQUENCE TYPE: nucleic acid 
5 STRANDEDNESS: double 
TOPOLOGY: linear 
MOLECULE TYPE: cDNA to mRNA 
HYPOTHETICAL: No 
ANTI-SENSE: No 
10 ORIGINAL SOURCE: 

ORGANISM: rat (Rattus norvegicu 

TISSUE TYPE: brain 
FEATURE: 



FEATURE KEY: 5' UTR 



LOCATION: 1..75 



IDENTIFICATION METHOD: E 



FEATURE KEY: CDS 



LOCATION: 76. .24 



IDENTIFICATION METHOD: E 



3' UTR 



LOCATION/2407.. 3977 
IDENTIFICATION METHOD: E 
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FEATURE KEY: polyA signal 
LOCATION: 3978. .4008 
IDENTIFICATION METHOD: E 
SEQUENCE DESCRIPTION: 
5 GCCGAGGCCC GCGCAGTAGC GGTACTAAGT AGAGGCTGCT GGACGCGCCC CACtfCGGCAC 



60 



CAGGCGGAGC CAGAGATGCT TGCCAGGGCC GAGCGGCCCC GCCCGGGCCC CCGGCCGCCT 120 

CCGGTCTTTC CCTTCCCGCC GCCGCTGTCG CTGCTGCTGC TGCTGGCGAT /ACTAAGCGCC 180 

CCGGTGTGCG GCCGCGTCCC CCGCTCAGTG CCCAGAACCT CGCTGCCCM CTCCGAGGCT 240 

Q GACTCCTATC TCACCCGGTT TGCAGCGTCT CATACGTACA ATTACTCTGC TCTCCTTGTG 300 

(y 10 GATCCTGCCT CCCACACACT TTACGTCGGT GCACGGGATA GCATCTtCGC TTTAACCCTC 360 

m / 

=P CCCTTCTCTG GGGAAAGACC CCGAAGGATC GACTGGATGG TACCTGAGAC TCACAGACAG 420 



AACTGCAGGA AGAAAGGCAA GAAAGAGGAC GAATGTCACA ATTT/FATCCA GATTCTCGCC 480 

ATTGTCAATG CCTCTCACCT CCTCACGTGC GGCACCTTCG CTTfrTGATCC GAAGTGCGGG 540 

GTTATTGATG TGTCCAGTTT CCAGCAGGTT GAAAGACTTG AGftGCGGCCG GGGGAAATGT 600 

15 CCTTTTGAGC CAGCTCAACG GTCAGCAGCT GTAATGGCTG QGGGCGTCCT CTACACCGCC 660 

ACTGTGAAGA ACTTCCTGGG GACTGAGCCC ATCATCTCCC GAGCTGTGGG TCGAGCTGAG 720 

GACTGGATTC GAACAGAGAC CTTGTCATCC TGGCTTAATG/ CTCCAGCCTT TGTCGCAGCT 780 

ATGGTCCTGA GCCCAGCTGA GTGGGGGGAT GAAGATGGAtt ACGATGAAAT CTTTTTTTTC 840 

TTCACGGAGA CCTCCCGAGT GTTGGACTCC TATGAGCGuA TCAAGGTCCC AAGAGTGGCC 900 

20 CGAGTGTGTG CGGGGGACCT TGGGGGCAGG MGACCCTfrC AGCAGAGATG GACGACGTTT 960 

CTGAAGGCTG ACCTGCTGTG CCCAGGGCCC GAGCATGGCC GGGCCTCCGG GGTTCTGCAG 1020 

GCTATGGCAG AGCTTCGGCC TCAGCCTGGA GCGGGAACCC CCATCTTTTA TGGGATCTTT 1080 

TCCTCCCAGT GGGAAGGAGC TGCCATCTCT GCTGTGTGTG CCTTCCGACC CCAAGACATC 1140 

CGGGCAGTGC TGAATGGTCC CTTTAGAGAG CTAAAAOATG ACTGCAACAG GGGACTGCCT 1200 

25 GTCATGGACA ACGAGGTGCC CCAGOCCAGA CCTGGA(VgT^^TCGPCAA CAACATGAAG 1260 
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CTCCAGCAGT TTGGATCCTC ACTCTCCCTG CCAGACCGCG TGCTCACCTT TATCAGAGAC 1,320 
CACCCTCTCA TGGACAGGCC CGTGTTCCCG GCTGACGGCC GCCCCCTGCT GGTCACTACA / 1380 

GATACAGCCT ATCTCAGAGT CGTGGCCCAC AGGGTGACCA GCCTCTCAGG GAAAGAATAT 1440 

GACGTGCTCT ACCTGGGGAC AGAGGATGGA CACCTCCACC GGGCTGTGCG CATTG0AGCT 1500 

5 CAGCTCAGTG TCTTGGAGGA TCTGGCCTTG TTCCCAGAAC CACAGCCGGT TGAuAGCATG 1560 

AAATTGTACC ACGATTGGCT CCTGGTGGGC TCCCATACTG AGGTGACACA/AGTGAACACC 1620 

AGCAACTGTG GCCGTCTCCA GAGCTGCTCG GAGTGTATCC TGGCCCAQGA CCCCGTGTGC 1680 

GCCTGGAGCT TCCGGCTTGA TGCTTGTGTG GCCCACGCCG GCGAGCACCG CGGGATGGTT 1740 

m CAAGATATAG AGTCAGCGGA TGTCTCTTCT TTGTGTCCAA AA6AACCTGG AGAACATCCC 1800 

*0 / 

fy 10 GTAGTGTTTG AAGTTCCGGT GGCTACTGTG GGCCACGTGG yTCCTGCCATG TTCCCCCAGT 1860 

08 

=F TCTGCCTGGG CATCCTGTGT GTGGCACCAG CCCAGTGG/iG TGACTGCGCT CACTCCCCGG 1920 

ro AGGGATGGAC TAGAGGTGGT GGTGACCCCA GGGGCGATGG GGGCTTATGC TTGCGAGTGT 1980 



s 

B 
fin 
P 
<J9 



CAGGAGGGTG GAGCCGCCCG CGTGGTGGCT GCT/fATAGCT TGGTGTGGGG CAGCCAGCGG 2040 

GGACCCTCAA ACCGGGCCCA CACCGTTGTG GGGGCTGGAT TGGTTGGCTT TCTCCTGGGT 2100 

15 GTTCTTGCAG CATCCCTCAC TCTCCTCCT(5 ATTGGTCGCC GTCAGCAGCG TCGGCGACAG 2160 

AGGGAGCTTC TAGCTAGAGA CAAGGTGj^GC TTAGATCTGG GGGCTCCACC TTCTGGGACC 2220 

ACAAGCTATA GTCAGGACCC TCCCTOTCCT TCGCCTGAAG ATGAACGGCT GCCCCTGGCC 2280 

CTGGGTAAGC GGGGCAGTGG TTT'DGGTGGC TTCCCTCCAC CCTTCCTGCT GGATTCTTGC 2340 

CCAAGCCCAG CCCACATCCG GCTCACTGGG GCGCCTCTAG CCACGTGTGA TGAGACCTCC 2400 

20 ATCTAAAGCC GGGGAAAATG ACTGCCAGCC ATGAGCAGTC TCTGGAACTA GTGGCTACCA 2460 

AGACCATGAT CATGGCTGCT /CCTTTCTCTT GGAGTCTGTG TGTTCACACA TTAGTGTCTG 2520 

TCCTCTGGAC CTGGACCTGCf CCTTTGCCCA GATTCCTGAT TCTCATGAGA GATCAACCCT 2580 

GTAACCTTCT GCGATGG(XT CTTGTCTTGG GCCCATCAGC TTGTGGGGTG GAGTAAGGAC 2640 

ATAGGCCCCG GAAAGGdAAT CAGTGTGGAG GTAGTTGGGG CGTGTGTGCC CTGCGTCCTT 2700 



25 GTGGTGGCTG TATGA/ITTCCXCAGTCTGCTG ACTCTGGGGA GCGCATGATC CCCTGACTGC 2760 



in 
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CTTGAGATCT CTCCCAACTC AGTTTCCCCT TGCTCTGGAA GAGTGTGTGT CTATACACTG ^820 
GTGTGCCTAG AAGGCCTGTC CATGTGTGCA TGGACGACAG GGCCGGTGCC TCCXJTGCTTt/ 2880 

TGGGGAGTCG GAGAGAAAGG TTGGAATGGG GGACAACTTA ACCCTCGGTA GCCAGTGAGG 2940 

GAAACCACAT GCCCGTCCCC ATCACCCCAC AGCGCTTCTT TAACTTTGAG CAAAG^TCCC 3000 

5 AAAGTGACCT TCTGGGTGGG AAGGGCAGCA GGACATGTGG CCCCCGTCCT TCTCCTTGTC 3060 

TTTCCCTTCT GGCTGCCAAC CACTGCCGTG CCACCGCTGC GCTTTCCCTG yGCTGGAGTGG 3120 

AGGCTGAGTC CTCTGTCCTT GGTTTCCATT TAAAATGAAC TTCACAAOAT TCTAAATATT 3180 

GGGGGATGAC AAATGACTTT TTTCCCCAGA AAAGTGTGTA GGAAATACAA GCAGGTTAAA 3240 

GAAGATTTGC CTCAGTGACT TTCACCCTTG CCCTAAAGCA GGAGTCCCTC AGCTAGCGTC 3300 

10 TGTGGACTCC CTGAAATTGT ATGCGTCTGT GGACTCCCTG AMTTGTATG CAAAGTGTCT 3360 

GTGTGTGTGT GTGTGTGTGT GTGTGTGTGT GTGTGTGTGt/gTTTGCGTGC ATGTGTGCAT 3420 

GTGTGTTTGA TGGCTTTCAT CAGATTCTCA AGGCCTTAAT GAGGTTAAAG GACCACGGCC 3480 

TATAGTCACC ACAGTTGGGC CACATGGAGG AGGTGTTGCT CTCTGAGGCA GTTCCTCCCT 3540 

GGCCTGCCTG AGGCCAGCCC CTGGACACAT TGCTGGTGGA GACCCCACAT CTCTCCAGAA 3600 

15 CTTGGAAGCT AGGCTCTGCG CGTGCTTGAA GGCACCACCA TCTCCCTTCT TGCTTCATTC 3660 

TCCTGTGTGC TCTGCCTCTG CTCAGTCCTG CT^TIGGCCT GTGAATGTGC CTCGCCCGTC 3720 

CCTGGTGGGG GACCTCAAAC CCCAGTGCTG ATGCTACCCT TTCCAGTGGG AGTTTCTGTT 3780 

CTGCTTTCCT TGACAGCAGC CTGTGAACTA CTCACGAGTC CCCTTGGTTT GGAGTTCCCG 3840 

GTGGCTTTGA GTAGGATCTT TGGCGTGGCA' TCTAACCTAG CAGCATTGAT CGTTCATTGT 3900 

20 AAAGTGGGGA TATACCTACC TCAGGGTT6C TGCAAGGATC AAACGAGGAA ACGTATAAAT 3960 

AAAGCATTAC CCACAGCAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAA 4008 



SEQ ID NO: 2 / 
SEQUENCE LENGTH:/2331 base pairs 
25 SEQUENCE TYPE.: nJcleic acid 



4U 



STRANDEDNESS: double 
TOPOLOGY: linear 
MOLECULE TYPE: cDNA to mRNA 
HYPOTHETICAL: No 
5 ANTI-SENSE: No 

ORIGINAL SOURCE: 

ORGANISM: rat (Rattus norvegicus) 
TISSUE TYPE: brain 
FEATURE: 

10 FEATURE KEY: CDS 
LOCATION: 1..2331 
IDENTIFICATION METHOD: E 
SEQUENCE DESCRIPTION: 
ATGCTTGCCA GGGCCGAGCG GCCCCGCCCG GGCOCCCGGC CGCCTCCGGT CTTTCCCTTC 60 

15 CCGCCGCCGC TGTCGCTGCT GCTGCTGCTG GCGATACTAA GCGCCCCGGT GTGCGGCCGC 120 
GTCCCCCGCT CAGTGCCCAG AACCTCGCTG CCCATCTCCG AGGCTGACTC CTATCTCACC 180 
CGGTTTGCAG CGTCTCATAC GTACAATTAe TCTGCTCTCC TTGTGGATCC TGCCTCCCAC 240 
ACACTTTACG TCGGTGCACG GGATAGCATC TTCGCTTTAA CCCTCCCCTT CTCTGGGGAA 300 
AGACCCCGAA GGATCGACTG GATGGTACCT GAGACTCACA GACAGAACTG CAGGAAGAAA 360 

20 GGCAAGAAAG AGGACGAATG TCACMTTTT ATCCAGATTC TCGCCATTGT CAATGCCTCT 420 
CACCTCCTCA CGTGCGGCAC CTTCGCTTTT GATCCGAAGT GCGGGGTTAT TGATGTGTCC 480 
AGTTTCCAGC AGGTTGAAAG AOTTGAGAGC GGCCGGGGGA AATGTCCTTT TGAGCCAGCT 540 
CAACGGTCAG CAGCTGTAAT GGCTGGGGGC GTCCTCTACA CCGCCACTGT GAAGAACTTC 600 
CTGGGGACTG AGCCCATCAT uTCCCGAGCT GTGGGTCGAG CTGAGGACTG GATTCGAACA 660 

25 GAGACCTTGT CATCCTGGCT TiAATGCTCCA GCCTTTGTCG CAGCTATGGT CCTGAGCCCA 720 
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GCTGAGTGGG GGGATGAAGA TGGAGACGAT GAAATCTTTT TTTTCTTCAC GGAGACCTCC / 780 



CGAGTGTTGG ACTCCTATGA GCGCATCAAG GTCCCAAGAG TGGCCCGAGT GTGTGCG 



840 



GACCTTGGGG GCAGGAAGAC CCTTCAGCAG AGATGGACGA CGTTTCTGAA GGCTGACCTG 900 

CTGTGCCCAG GGCCCGAGCA TGGCCGGGCC TCCGGGGTTC TGCAGGCTAT GGCA^AGCTT 960 

5 CGGCCTCAGC CTGGAGCGGG AACCCCCATC TTTTATGGGA TCTTTTCCTC CCAGTGGGAA 1020 

' GGAGCTGCCA TCTCTGCTGT GTGTGCCTTC CGACCCCAAG ACATCCGGGC AGTGCTGAAT 1080 

GGTCCCTTTA GAGAGCTAAA ACATGACTGC AACAGGGGAC TGCCrGTCjOT GGACAACGAG 1140 

GTGCCCCAGC CCAGACCTGG AGAGTGCATC GCCAACAACA TGAAGCTCCA GCAGTTTGGA 1200 

TCCTCACTCT CCCTGCCAGA CCGCGTGCTC ACCTTTATCA GAGAcAcCC TCTCATGGAC 1260 

jfi 10 AGGCCCGTGT TCCCGGCTGA CGGCCGCCCC CTGCTGGTCA CTAdAGATAC AGCCTATCTC 1320 

^ AGAGTCGTGG CCCACAGGGT GACCAGCCTC TCAGGGAAAG AMATGACGT GCTCTACCTG 1380 

^ GGGACAGAGG ATGGACACCT CCACCGGGCT GTGCGCATTG GAGCTCAGCT CAGTGTCTTG 1440 

r GAGGATCTGG CCTTGTTCCC AGAACCACAG CCGGTTGAGA/GCATGAAATT GTACCACGAT 1500 

ffl TGGCTCCTGG TGGGCTCCX:A TACTGAGGTG ACACAAGTGA ACACCAGCAA CTGTGGCCGT 1560 

ps^ / 

*S 15 CTCCAGAGCT GCTCGGAGTG TATCCTGGCC CAGGACCOCG TGTGCGCCTG GAGCTTCCGG 1620 

*0 / 

- CTTGATGCTT GTGTGGCCCA CGCCGGCGAG CACCGCGGGA TGGTTCAAGA TATAGAGTCA 1680 

GCGGATGTCT CTTCTTTGTG TCCAAAAGAA CCTGGAGAAC ATCCCGTAGT GTTTGAAGTT 1740 

CCGGTGGCTA CTGTGGGCCA CGTGGTCCTG CCATGTTCCC CCAGTTCTGC CTGGGCATCC 1800 



TGTGTGTGGC ACCAGCCCAG TGGAGTGACT 



:TCACTC CCCGGAGGGA TGGACTAGAG 1860 



20 GTGGTGGTGA CCCCAGGGGC CATGGGGGCT TATGCTTGCG AGTGTCAGGA GGGTGGAGCC 1920 

GCCCGCGTGG TGGCTGCTTA TAGCTTGGTG TGGGGCAGCC AGCGGGGACC CTCAAACCGG 1980 

GCCCACACCG TTGTGGGGGC TGGATTGGTT / GGCTTTCTCC TGGGTGTTCT TGCAGCATCC 2040 

CTCACTCTCC TCCTGATTGG TCGCCGTCAP CAGCGTCGGC GACAGAGGGA GCTTCTAGCT 2100 

AGAGACAAGG TGGGCTTAGA TCTGGGGGCT CCACCTTCTG GGACCACAAG CTATAGTCAG 2160 

25 GACCCTCCCT CTCCTTCGCC TGAAGATdAA CGGCTGCCCC TGGCCCTGGG TAAGCGGGGC 2220 



■72- 



AGTGGTTTTG GTGGCTTCCC TCCACCCTTC CTGCTGGATT CTTGCCCAAG CCCAGCCC4 



2280 



ATCCGGCTCA CTGGGGCGCC TCTAGCCACG TGTGATGAGA CCTCCATCTA A 



2331 



nj 10 



15 



20 



SEQ ID NO: 3 

SEQUENCE LENGTH: 776 amino acids 
SEQUENCE TYPE: amino acid 
TOPOLOGY: linear 
MOLECULE TYPE: peptide 
ORIGINAL SOURCE: 

ORGANISM: rat (Rattus norvegicus) 

TISSUE TYPE: brain 
FEATURE: 

FEATURE KEY: peptide 

LOCATION: 1..776 

IDENTIFICATION METHOD: P 
SEQUENCE DESCRIPTION: 
Met Leu Ala Arg Ala Glu Arg Pro Arg firo Gly Pro Arg Pro Pro Pro 

1 5 / 10 15 

Val Phe Pro Phe Pro Pro Pro Leii Se^.Leu Leu Leu Leu Leu Ala lie 
"20 

Leu Ser Ala Pro Val Cys Gly Arg fal Pro Arg Ser Val Pro Arg Thr 

35 40/ 45 

Ser Leu Pro He Ser Glu Ala Aslp Ser Tyr Leu Thr Arg Phe Ala Ala 
50 55 / 60 



25 Ser His Thr Tyr Asn Tyr Ser 



la Leu Leu Val Asp Pro Ala Ser His 



73- 



65 



70 



75 



8C 



Thr Leu Tyr Val Gly Ala Arg Asp Ser lie Phe Ala Leu Thr Leu pro 

85 90 
Phe Ser Gly Glu Arg Pro Arg Arg He Asp Trp Met Val Pro/Glu Thr 

100 105 110 

His Arg Gin Asn Cys Arg Lys Lys Gly Lys Lys Glu Asp/Glu Cys His 

115 120 125 

Asn Phe He Gin He Leu Ala He Val Asn Ala Ser/His Leu Leu Thr 
130 135 

/ 

Cys Gly Thr Phe Ala Phe Asp Pro Lys Cys Gly^/Val He Asp Val Ser 
145 150 155 160 

Ser Phe Gin Gin Val Glu Arg Leu Glu Ser 6ly Arg Gly Lys Cys Pro 

165 176 175 

Phe Glu Pro Ala Gin Arg Ser Ala Ala yal Met Ala Gly Gly Val Leu 

180 185/ 190 

Tyr Thr Ala Thr Val Lys Asn Phe hJu Gly Thr Glu Pro He He Ser 

195 200 / 205 

Arg Ala Val Gly Arg Ala Glu Asp/ Trp He Arg Thr Glu Thr Leu Ser 

210 215 I 220 

Ser Trp Leu Asn Ala Pro Ala Phe Val Ala Ala Met Val Leu Ser Pro 
225 230 / 235 240 

Ala Glu Trp Gly Asp Glu Asto Gly Asp Asp Glu He Phe Phe Phe Phe 

245 / 250 255 

Thr Glu Thr Ser Arg Val Leu Asp Ser Tyr Glu Arg He Lys Val Pro 
260 / r 265 270 



• 
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/ 



Arg Val Ala Arg Val Cys Ala Gly Asp Leu Gly Gly Arg Lys Thr Let/ 

275 280 285 

Gin Gin Arg Trp Thr Thr Phe Leu Lys Ala Asp Leu Leu Cys Pi;6 Gly 
290 295 300 

5 Pro Glu His Gly Arg Ala Ser Gly Val Leu Gin Ala Met Ala Glu Leu 
305 310 315 / 320 

Arg Pro Gin Pro Gly Ala Gly Thr Pro He Phe Tyr Gly He Phe Ser 

325 330 / 335 

Ser Gin Trp Glu Gly Ala Ala He Ser Ala Val 9ys Ala Phe Arg Pro 
10 340 345 / 350 

Gin Asp He Arg Ala Val Leu Asn Gly Pro Rhe Arg Glu Leu Lys His 

355 360 / 365 

Asp Cys Asn Arg Gly Leu Pro Val Met Asp Asn Glu Val Pro Gin Pro 
370 375 / 380 

15 Arg Pro Gly Glu Cys He Ala Asn As/ Met Lys Leu Gin Gin Phe Gly 
385 390 / 395 400 

Ser Ser Leu Ser Leu Pro Asp Arm Val Leu Thr Phe He Arg Asp His 

405 / 410 415 

Pro Leu Met Asp Arg Pro Val yrhe Pro Ala Asp Gly Arg Pro Leu Leu 
20 420 / 425 430 

Val Thr Thr Asp Thr Ala yyr Leu Arg Val Val Ala His Arg Val Thr 

435 / 440 445 

Ser Leu Ser Gly Lys Gfu Tyr Asp Val Leu Tyr Leu Gly Thr Glu Asp 
450 / 455 460 

25 Gly His Leu His Arg/ Ala Val Arg He Gly Ala Gin Leu Ser Val Leu 



465 



470 



475 



480/ 



Glu Asp Leu Ala Leu Phe Pro Glu Pro Gin Pro Val Glu Ser Met l/s 

485 490 495 

Leu Tyr His Asp Trp Leu Leu Val Gly Ser His Thr Glu Val 'phr Gin 

500 505 51C 

Val Asn Thr Ser Asn Cys Gly Arg Leu Gin Ser Cys Ser (flu Cys lie 

515 520 52E 

Leu Ala Gin Asp Pro Val Cys Ala Trp Ser Phe Arg I^eu Asp Ala Cys 

530 535 540> 

Val Ala His Ala Gly Glu His Arg Gly Met Val Gl/n Asp He Glu Ser 
545 550 555 / 560 

Ala Asp Val Ser Ser Leu Cys Pro Lys Glu Pr</ Gly Glu His Pro Val 

565 570 / 575 

Val Phe Glu Val Pro Val Ala Thr Val Gly /His Val Val Leu Pro Cys 

580 585 / 590 

Ser Pro Ser Ser Ala Trp Ala Ser Cys Val Trp His Gin Pro Ser Gly 

595 600 / 605 

Val Thr Ala Leu Thr Pro Arg Arg Asp /Gly Leu Glu Val Val Val Thr 

610 615 7 620 

Pro Gly Ala Met Gly Ala Tyr Ala CVs Glu Cys Gin Glu Gly Gly Ala 
625 630 / 635 640 

Ala Arg Val Val Ala Ala Tyr Ser £eu Val Trp Gly Ser Gin Arg Gly 

645 / 650 655 

Pro Ser Asn Arg Ala His Thr Valf Val Gly Ala Gly Leu Val Gly Phe 
660 / 665 670 
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Leu Leu Gly Val Leu Ala Ala Ser Leu Thr Leu Leu Leu lie Gly Argy 

675 680 685 

Arg Gin Gin Arg Arg Arg Gin Arg Glu Leu Leu Ala Arg Asp Lys/Val 
690 695 700 

5 Gly Leu Asp Leu Gly Ala Pro Pro Ser Gly Thr Thr Ser Tyr/ Ser Gin 
705 710 715 / 720 

Asp Pro Pro Ser Pro Ser Pro Glu Asp Glu Arg Leu Pro 7 Leu Ala Leu 

725 730 / 735 

Gly Lys Arg Gly Ser Gly Phe Gly Gly Phe Pro Pro' Pro Phe Leu Leu 

740 745 / 750 

Asp Ser Cys Pro Ser Pro Ala His He Arg Leu/Thr Gly Ala Pro Leu 

755 760 / 765 

Ala Thr Cys Asp Glu Thr Ser He 
770 775 



10 



15 

SEQ ID NO: 4 

SEQUENCE LENGTH: 2315 bkse pairs 

SEQUENCE TYPE: nucleic a/cid 

STRANDEDNESS: double 
20 TOPOLOGY: linear 

MOLECULE TYPE: cDN^ to mRNA 

HYPOTHETICAL: No 

ANTI-SENSE: No 

ORIGINAL SOURCE: 
25 ORGANISM: human (Homo sapiens) 




i 



TISSUE TYPE: brain 



FEATURE: 



FEATURE KEY: CDS 



LOCATION: 1..1764 



IDENTIFICATION METHOD: E 



FEATURE KEY: 3' UTR 
LOCATION: 1765. .2315 
IDENTIFICATION METHOD: E 



jjj 10 SEQUENCE DESCRIPTION: 

> GGGGGTGTCC TCTATGCTGC CACTGTGAAA AACTACCTGG yGGACGGAGCC AATTATCACC 60 

©J AGAGCAGTGG GTCGTGCCGA GGACTGGATT CGGACAGATA CCTTGCCTTC CTGGCTGAAC 120 

I 

^ GCCCCAGCCT TTGTCGCAGC CGTGGCCTTG AGCCCAGOCG AATGGGGGGA TGAAGATGGA 180 

2 GACGACGAAA TCTACTTCTT CTTTACGGAG ACTTCCCGAG CATTTGACTC ATACGAGCGC 240 

LJ I 

ys / 

^ 15 ATTAAAGTCC CACGGGTGGC CCGTGTGTGT GCGGGGGACC TCGGGGGCCG GAAGACCCTC 300 

s.n / 

" CAGCAGAGAT GGACGACGTT TTTGAAAGCT GACJtTGCTCT GTCCAGGGCC TGAGCATGGC 360 

CGGGCCTCCA GTGTCCTGCA GGATGTTGCT GTOCTTCGAC CTGAGCTTGG GGCAGGGACT 420 

CCCATCTTTT ATGGCATCTT TTCTTCCCAG TGGGAGGGGG CTACTATCTC TGCTGTCTGT 480 

GCCTTCCGAC CACAAGACAT TCGGACAGTG/CTGAATGGTC CCTTCAGAGA ACTAAAACAT 540 

20 GACTGCAACA GAGGACTGCC TGTCGTGGAC AATGATGTGC CCCAGCCCAG ACCTGGAGAG 600 

TGCATCACCA ACAACATGAA GCTCCGGCAC TTTGGCTCAT CTCTCTCCCT GCCTGACCGC 660 

GTACTCACCT TCATCCGGGA CCACCCftCTC ATGGACAGGC CAGTGTTTCC AGCTGATGGC 720 

CACCCCCTGC TGGTCACTAC AGATAvAGCC TATCTCAGAG TCGTGGCCCA CAGGGTGACC 780 

AGCCTCTCAG GGAAAGAGTA TGATGTGCTC TACCTGGGGA CAGAGGATGG ACACCTCCAC 840 

25 CGAGCAGTGC GGATCGGAGC TCAGCTCAGC GTTCTTGAAG ATCTGGCCTT ATTCCCAGAG 900 



- =5^ 



m 
□ 

U3 
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CCACAGCCAG TTGAGAACAT GAAATTGTAC CACAGCTGGC TCCTGGTTGG CTCCCGTACT 960 

GAGGTGACAC AAGTGAATAC AACCAACTGT GGCCGTCTCC AGAGCTGCTC AGAGTGCATC 1020 

CTGGCCCAGG ACCCAGTCTG TGCCTGGAGC TTCCGGCTGG ATGAGTGTGT GGO^CATGCC 1080 

GGGGAGCACC GAGGGTTGGT CCAAGACATA GAGTCAGCAG ATGTCTCCTC TTTGTGTCCT 1140 

5 AAAGAGCCTG GAGAACGTCC AGTAGTGTTT GAAGTTCCCG TGGCTACAGy TGCGCATGTG 1200 

GTCTTGCCAT GTTCTCCAAG CTCAGCATGG GCATCCTGTG TGTGGCAGCA GCCCAGTGGA 1260 

GTGACTGCAC TCACCCCCCG GCGGGATGGA CTGGAGGTGG TGGTGACCCC AGGGGCCATG 1320 

GGCGCTTATG CCTGTGAATG TCAGGAGGGT GGGGCAGCCC ATGTGGTAGC AGCTTACAGC 1380 

TTGGTATGGG GCAGCCAGCG AGATGCTCCG AGCCGGGCCC ACACAGTGGG GGCGGGACTG 1440 

10 GCTGGCTTCT TCTTGGGGAT TCTCGCAGCA TCCCTGACTC JCATTCTGAT TGGTCGGCGT 1500 

CAGCAGCGAC GGCGACAGAG GGAACTTCTG GCTAGAGACA' AGGTGGGCCT GGACCTGGGG 1560 

/ 

GCTCCACCTT CTGGGACCAC AAGCTACAGC CAAGACCCTC CCTCCCCCTC TCCTGAAGAT 1620 

GAGCGGTTGC CGCTGGCCCT GGCCAAGAGG GGCAGTGGCT TTGGTGGATT CTCACCACCC 1680 

TTCCTGCTTG ATCCTTGCCC AAGCCCAGCC CACATTCGGC TAACTGGGGC TCCTCTAGCC 1740 

15 ACATGTGATG AAACATCCAT CTAGAGCTGG GCAAATGACC ACTAGTGTAT AAGTGATCAC 1800 

CTGAGTTCTC TTTGAGTATG AGTGATTACT TGQATTTTAG TATCTGTTCT CTCTGAGCCT 1920 



TGGAACGGAG TGACCACTGA GATGCTGGGG GTCACTGGGC CTGGAAGACC ATCCCAGCCT 1860 



GGATGGGCTT GGGGCCAGAC CTTTGCCTGA T/TCCTGATTC CCATGAGAAA TCAGAACTGC 1980 

TTTCTGCAGC AAATCAGGGC TTCCCCCTAA CATCTGAACT CCTGTAAACC TTCATCCCTG 2040 

20 GCCCCCTATC TTGGGCCCAT TAGTTTTGGG /GATGGGGCAC AGGGCATAGC TATGACTTTG 2100 

CTTTCTGGTT GGAGCCTGGC CGGAAGGAAGl AGCCCTGGAG GTGGTTGGGG GCAAATGTGC 2160 

CCTGAGTCCT TGGGGTGGTT CTGCTTATTO TTCAAGTTTA TCTGAATCTG TGGGGAGTGC 2220 

ATGATCCCCA TGTTGCAATA TGGAGTCTCT GCCCTGAGAT CTTCCCCATC TCAGTTTTCC 2280 

TTCCATGAAA GAGTACGTGT AAATACAT4G TGTTC 2315 

25 




\ 10 
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SEQ ID NO: 5 

SEQUENCE LENGTH: 1761 base pairs 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: double 
TOPOLOGY: linear 
MOLECULE TYPE: cDNA to mRNA 
HYPOTHETICAL: No 
ANTI-SENSE: No 
ORIGINAL SOURCE: 

ORGANISM: human (Homo sapiens) 

TISSUE TYPE: brain 
FEATURE: 

FEATURE KEY: CDS 

LOCATION: 1..1761 

IDENTIFICATION METHOD: E 
SEQUENCE DESCRIPTION: 

GGGGGTGTCC TCTATGCTGC CACTGTGAAA AACTACCTGG/ GGACGGAGCC AATTATCACC 



AGAGCAGTGG GTCGTGCCGA GGACTGGATT CGGACAGATA CCTTGCCTTC CTGGCTGAAC 120 

GCCCCAGCCT TTGTCGCAGC CGTGGCCTTG AGCCCAGQCG AATGGGGGGA TGAAGATGGA 180 

20 GACGACGAAA TCTACTTCTT CTTTACGGAG ACTTCCCGAG CATTTGACTC ATACGAGCGC 240 

ATTAAAGTCC CACGGGTGGC CCGTGTGTGT GCGGGGGACC TCGGGGGCCG GAAGACCCTC 300 

CAGCAGAGAT GGACGACGTT TTTGAAAGCT GACCTGCTCT GTCCAGGGCC TGAGCATGGC 360 

CGGGCCTCCA GTGTCCTGCA GGATGTTGCT GTGQTTCGAC CTGAGCTTGG GGCAGGGACT 420 

CCCATCTTTT ATGGCATCTT TTCTTCCCAG TGGGAGGGGG CTACTATCTC TGCTGTCTGT 480 

25 GCCTTCCGAC CACAAGACAT TCGGACAGTG CTjGAATGGTC CCTTCAGAGA ACTAAAACAT 540 





60 
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GACTGCAACA GAGGACTGCC TGTCGTGGAC AATGATGTGC CCCAGCCCAG ACCTGGAGAJ5 600 

TGCATCACCA ACAACATGAA GCTCCGGCAC TTTGGCTCAT CTCTCTCCCT GCCTGAGCGC 660 

GTACTCACCT TCATCCGGGA CCACCCACTC ATGGACAGGC CAGTGTTTCC AGCTGATGGC 720 

CACCCCCTGC TGGTCACTAC AGATACAGCC TATCTCAGAG TCGTGGCCCA CAGGGTGACC 780 

5 AGCCTCTCAG GGAAAGAGTA TGATGTGCTC TACCTGGGGA CAGAGGATGG^CACCTCCAC 840 

CGAGCAGTGC GGATCGGAGC TCAGCTCAGC GTTCTTGAAG ATCTGGC^TT ATTCCCAGAG 900 

CCACAGCCAG TTGAGAACAT GAAATTGTAC CACAGCTGGC TCCTGGTTGG CTCCCGTACT 960 

/ 

GAGGTGACAC AAGTGAATAC AACCAACTGT GGCCGTCTCC AGAGCTGCTC AGAGTGCATC 1020 

CTGGCCCAGG ACCCAGTCTG TGCCTGGAGC TTCCGGCTGG ATGAGTGTGT GGCCCATGCC 1080 

10 GGGGAGCACC GAGGGTTGGT CCAAGACATA GAGTCAGCAG ATGTCTCCTC TTTGTGTCCT 1140 

AAAGAGCCTG GAGAACGTCC AGTAGTGTTT GMGTTCCCG^TGGCTACAGC TGCGCATGTG 1200 

GTCTTGCCAT GTTCTCCAAG CTCAGCATGG GCATCCTGT£ TGTGGCACCA GCCCAGTGGA 1260 

GTGACTGCAC TCACCCCCCG GCGGGATGGA CTGGAG^TGG TGGTGACCCC AGGGGCCATG 1320 

GGCGCTTATG CCTGTGAATG TCAGGAGGGT GGGGCAGCCC ATGTGGTAGC AGCTTACAGC 1380 

15 TTGGTATGGG GCAGCCAGCG AGATGCTCCG AGCGGGGCCC ACACAGTGGG GGCGGGACTG 1440 

GCTGGCTTCT TCTTGGGGAT TCTCGCAGCA TCCCTGACTC TCATTCTGAT TGGTCGGCGT 1500 

/ 

CAGCAGCGAC GGCGACAGAG GGAACTTCTG^GCTAGAGACA AGGTGGGCCT GGACCTGGGG 1560 

GCTCCACCTT CTGGGACCAC AAGCTACAGG CAAGACCCTC CCTCCCCCTC TCCTGAAGAT 1620 

/ 

GAGCGGTTGC CGCTGGCCCT GGCCAAGAGG GGCAGTGGCT TTGGTGGATT CTCACCACCC 1680 

20 TTCCTGCTTG ATCCTTGCCC AAGCCCAGCC CACATTCGGC TAACTGGGGC TCCTCTAGCC 1740 

ACATGTGATG AAACATCCAT C / 1761 



SEQ ID NO: 6 
SEQUENCE LENGTH:1 587 amino acids 
25 SEQUENCE TYPE: aJino acid 




TOPOLOGY: linear 
MOLECULE TYPE: peptide 
ORIGINAL SOURCE: 

ORGANISM: human (Homo sapiens) 
5 TISSUE TYPE: brain 
FEATURE: 

FEATURE KEY: peptide 
LOCATION: 1..587 
„ IDENTIFICATION METHOD: P 

m 10 SEQUENCE DESCRIPTION: 

iT= 

£ Gly Gly Val Leu Tyr Ala Ala Thr Val Lys Asn Tyr /Leu Gly Thr Glu 
N= / 
CO 5 10/15 



Pro lie He Thr Arg Ala Val Gly Arg Ala Glu Afep Trp lie Arg Thr 

20 25 / 30 

/ 

15 Asp Thr Leu Pro Ser Trp Leu Asn Ala Pro Aid Phe Val Ala Ala Val 
~ 35 40 / 45 

Ala Leu Ser Pro Ala Glu Trp Gly Asp Glu Asp Gly Asp Asp Glu He 

50 55 / 60 

Tyr Phe Phe Phe Thr Glu Thr Ser Arg Ala* Phe Asp Ser Tyr Glu Arg 
20 65 70 / 75 80 

He Lys Val Pro Arg Val Ala Arg Val CVs Ala Gly Asp Leu Gly Gly 

85 90 95 

Arg Lys Thr Leu Gin Gin Arg Trp Thr ihr Phe Leu Lys Ala Asp Leu 



100 



105 



110 



25 Leu Cys Pro Gly Pro Glu His Gly Arg Ala Ser Ser Val Leu Gin Asp 



•82- 



115 



120 



125 



Val Ala Val Leu Arg Pro Glu Leu Gly Ala Gly Thr Pro He Phe Ty| 

130 135 140 

Gly He Phe Ser Ser Gin Trp Glu Gly Ala Thr He Ser Ala Val/Cys 
5 145 150 155 / 160 

Ala Phe Arg Pro Gin Asp He Arg Thr Val Leu Asn Gly Pro frhe Arg 

165 170 / 175 

Glu Leu Lys His Asp Cys Asn Arg Gly Leu Pro Val Val Asp Asn Asp 
180 185 / 190 

10 Val Pro Gin Pro Arg Pro Gly Glu Cys He Thr Asn Afen Met Lys Leu 
195 200 /205 

Arg His Phe Gly Ser Ser Leu Ser Leu Pro Asp Arg Val Leu Thr Phe 

210 215 220 

He Arg Asp His Pro Leu Met Asp Arg Pro Val tfhe Pro Ala Asp Gly 
15 225 230 * 235/ 240 

His Pro Leu Leu Val Thr Thr Asp Thr Ala Tyif Leu Arg Val Val Ala 

245 250 / 255 

His Arg Val Thr Ser Leu Ser Gly Lys Glu fyr Asp Val Leu Tyr Leu 
260 265 / 270 

20 Gly Thr Glu Asp Gly His Leu His Arg Ala/ Val Arg He Gly Ala Gin 
275 280 / 285 

Leu Ser Val Leu Glu Asp Leu Ala Leu Phe Pro Glu Pro Gin Pro Val 



290 



295 



Glu Asn Met Lys Leu Tyr His Ser Trp L 
25 305 310 



300 



eu Leu Val Gly Ser Arg Thr 



315 



320 
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Glu Val Thr Gin Val Asn Thr Thr Asn Cys Gly Arg Leu Gin Ser Cys 

325 330. 335 

Ser Glu Cys He Leu Ala Gin Asp Pro Val Cys Ala Trp Ser Phe Arg 
340 345 350 

5 Leu Asp Glu Cys Val Ala His Ala Gly Glu His Arg Gly Leu Val /fcln 
355 360 365 

Asp He Glu Ser Ala Asp Val Ser Ser Leu Cys Pro Lys Glu/ Pro Gly 

370 375 380 

Glu Arg Pro Val Val Phe Glu Val Pro Val Ala Thr Ala/ Ala His Val 
10 385 390 395 / 400 

Val Leu Pro Cys Ser Pro Ser Ser Ala Trp Ala Ser/ Cys Val Trp His 

405 410 / 415 

Gin Pro Ser Gly Val Thr Ala Leu Thr Pro Argf Arg Asp Gly Leu Glu 

'H 420 425 / 430 

Lj 

2 15 Val Val Val Thr Pro Gly Ala Met Gly Ala' Tyr Ala Cys Glu Cys Gin 
435 440 / 445 

Glu Gly Gly Ala Ala His Val Val Ala' Ala Tyr Ser Leu Val Trp Gly 

450 455 / 460 

Ser Gin Arg Asp Ala Pro Ser Arg/Ala His Thr Val Gly Ala Gly Leu 
20 465 470 / 475 480 

Ala Gly Phe Phe Leu Gly He lieu Ala Ala Ser Leu Thr Leu He Leu 

485 / 490 495 

He Gly Arg Arg Gin Gin Ar£ Arg Arg Gin Arg Glu Leu Leu Ala Arg 
500 / 505 510 

25 Asp Lys Val Gly Leu Asp lieu Gly Ala/ Pro Pro Ser Gly Thr Thr Ser 



u 
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515 



520 



525 



Tyr Ser Gin Asp Pro Pro Ser Pro Ser Pro Glu Asp Glu Arg Leu P/o 

530 535 540 

Leu Ala Leu Ala Lys Arg Gly Ser Gly Phe Gly Gly Phe Ser P/o Pro 
545 550 555 / 560 

Phe Leu Leu Asp Pro Cys Pro Ser Pro Ala His He Arg Le/i Thr Gly 

565 570 / 575 

Ala Pro Leu Ala Thr Cys Asp Glu Thr Ser He 

580 585 



SEQ ID NO: 7 

SEQUENCE LENGTH: 196 base pairs 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: double 
TOPOLOGY: linear 
MOLECULE TYPE: cDNA to mRNA 
HYPOTHETICAL: No 
ANTI-SENSE: No 
ORIGINAL SOURCE: 

ORGANISM: human (Homo sap/ens) 

TISSUE TYPE: brain 
FEATURE: 

FEATURE KEY: CDS 



LOCATION: 1..196 



IDENTIFICATION METHOD E 



-85- 

SEQUENCE DESCRIPTION: 

AAATTGTACC ACAGCTGGCT CCTGGTTGGC TCCCGTACTG AGGTGACACA AGYGAATACA 60 
ACCAACTGTG GCCGTCTCCA GAGCTGCTCA GAGTGCATCC TGGCCCAGGA QCCAGTCTGT 120 
GCCTGGAGCT TCCGGCTGGA TGAGTGTGTG GCCCATGCCG GGGAGCACCg/aGGGTTGGTC 180 



CAAGACATAG AGTCAG 



196 



SEQ ID NO: 8 

SEQUENCE LENGTH: 30 base pairs 
SEQUENCE TYPE: nucleic acid 
10 STRANDEDNESS: single 
TOPOLOGY: linear 

MOLECULE TYPE: Other nucleic acid, /synthetic DNA 
HYPOTHETICAL: No 
ANTI-SENSE: No 
,i 15 SEQUENCE DESCRIPTION: 
~ GATAAGGATC CGGGTCGCCG TCAGCAGCGT 



n 



Sa=? 



30 



SEQ ID NO: 9 

SEQUENCE LENGTH: 27 base pair* 
20 SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 

MOLECULE TYPE: Other nucleic Jicid, synthetic DNA 
HYPOTHETICAL: No 
25 ANTI-SENSE: Yes I ^^""^ 



SEQUENCE DESCRIPTION: 
GGCTGGAATT CATTTTCCCC GGCTTTA 



27 



fn 



1=1 



SEQ ID NO: 10 
5 SEQUENCE LENGTH: 333 base pairs 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: double 
TOPOLOGY: linear 
MOLECULE TYPE: cDNA to mR#A 

ri 

f, 10 HYPOTHETICAL: No 
j= ANTI-SENSE: No 

ORIGINAL SOURCE: 

ORGANISM: human (Hom/> sapiens) 
TISSUE TYPE: brain 
| 15 FEATURE: 

FEATURE KEY: CDS 
LOCATION: 1..333 
IDENTIFICATION METHOD: E 
SEQUENCE DESCRIPTION: 
20 CCCCGGCCGG GTCCCGGGCA GCCTACAGCC TCGCCCTTCC CGCTACTGCT GCTGGCGGTG 
CTGAGCGGCC CGGTATCCGG CCGCGTCCCC CGCTCGGTGC CCAGAACCTC GCTTCCAATC 
TCTGAGGCTG ACTTCTGTCT CACuCGGTTC GCAGTCCCTC ACACATACAA TTACTCTGTT 
CTCCTTGTGG ATCCTGCCTC CCACACACTT TATGTTGGCG CCCGGGACAC CATCTTCGCT 
TTATCCCTGC CCTTCTCAGG GGAGAGACCC CGCAGGATTG ACTGGATGGT TCCTGAGGCT 
25 CACAGACAGA ACTGTAGGAA GMAGGCAAG AAA 



60 
120 
180 
240 
300 
333 
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SEQ ID NO: 11 

SEQUENCE LENGTH: 111 amino acids 
SEQUENCE TYPE: amino acid 
TOPOLOGY: linear 
MOLECULE TYPE: peptide 
ORIGINAL SOURCE: 

ORGANISM: human (Homo sapiens) 
TISSUE TYPE: brain 
FEATURE: 

FEATURE KEY: peptide 
LOCATION: 1..111 
IDENTIFICATION METHOD: 
SEQUENCE DESCRIPTION: 
Pro Arg Pro Gly Pro Gly Gin Pro Thf Ala Ser Pro Phe Pro Leu Leu 

5 / 10 15 

Leu Leu Ala Val Leu Ser Gly Pro fVal Ser Gly Arg Val Pro Arg Ser 

20 / 25 30 

Val Pro Arg Thr Ser Leu Pro lie Ser Glu Ala Asp Phe Cys Leu Thr 

35 /40 45 

Arg Phe Ala Val Pro His Thr/Tyr Asn Tyr Ser Val Leu Leu Val Asp 

50 53 60 

Pro Ala Ser His Thr Leu Tyf Val Gly Ala Arg Asp Thr He Phe Ala 
65 70 / 75 80 



Leu Ser Leu Pro Phe Ser G 



y Glu Arg Pro Arg Arg He Asp Trp Met 



a- 7 



• 
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85 



90 



95 



Val Pro Glu Ala His Arg Gin Asn Cys Arg Lys Lys Gly L^s Lys 
100 105 110 




SEQUENCE LISTING 

<110> KIMURA, Toru 

KIKUCHI, Kaoru 

<120> NOVEL SEMAPHORIN GENE: SEMAPHORIN W 

<130> 20-4546P 

<140> 
<141> 

<160> 14 

<170> Patentln Ver. 2.0 

<210> 1 
<211> 4008 
<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> 5'UTR 
<222> (1) . . (75) 

<220> 

<223> Coding region from residue 76 to 2406 
<220> 

<221> 3 1 UTR 
<222> () . . (3977) 

<220> 

<221> polyA_signal 
<222> () . . (4008) 

<400> 1 

gccgaggccc gcgcagtagc ggtactaagt agaggctgct ggacgcgccc cacccggcac 60 
caggcggagc cagagatgct tgccagggcc gagcggcccc gcccgggccc ccggccgcct 120 
ccggtctttc ccttcccgcc gccgctgtcg ctgctgctgc tgctggcgat actaagcgcc 180 
ccggtgtgcg gccgcgtccc ccgctcagtg cccagaacct cgctgcccat ctccgaggct 240 
gactcctatc tcacccggtt tgcagcgtct catacgtaca attactctgc tctccttgtg 300 
gatcctgcct cccacacact ttacgtcggt gcacgggata gcatcttcgc tttaaccctc 360 
cccttctctg gggaaagacc ccgaaggatc gactggatgg tacctgagac tcacagacag 420 
aactgcagga agaaaggcaa gaaagaggac gaatgtcaca attttatcca gattctcgcc 480 
attgtcaatg cctctcacct cctcacgtgc ggcaccttcg cttttgatcc gaagtgcggg 540 
gttattgatg tgtccagttt ccagcaggtt gaaagacttg agagcggccg ggggaaatgt 600 
ccttttgagc cagctcaacg gtcagcagct gtaatggctg ggggcgtcct ctacaccgcc 660 
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actgtgaaga acttcctggg gactgagccc atcatctccc gagctgtggg tcgagctgag 720 
gactggattc gaacagagac cttgtcatcc tggcttaatg ctccagcctt tgtcgcagct 780 
atggtcctga gcccagctga gtggggggat gaagatggag acgatgaaat cttttttttc 840 
ttcacggaga cctcccgagt gttggactcc tatgagcgca tcaaggtccc aagagtggcc 900 
cgagtgtgtg cgggggacct tgggggcagg aagacccttc agcagagatg gacgacgttt 960 
ctgaaggctg acctgctgtg cccagggccc gagcatggcc gggcctccgg ggttctgcag 1020 
gctatggcag agcttcggcc tcagcctgga gcgggaaccc ccatctttta tgggatcttt 1080 
tcctcccaNgt gggaaggagc tgccatctct gctgtgtgtg ccttccgacc ccaagacatc 1140 
cgggcagtgc tgaatggtcc ctttagagag ctaaaacatg actgcaacag gggactgcct 1200 
gtcatggaca acgaggtgcc ccagcccaga cctggagagt gcatcgccaa caacatgaag 1260 
ctccagcagt ttggatcctc actctccctg ccagaccgcg tgctcacctt tatcagagac 1320 
caccctctca tggacaggcc cgtgttcccg gctgacggcc gccccctgct ggtcactaca 1380 
gatacagcct atctcagagt cgtggcccac agggtgacca gcctctcagg gaaagaatat 1440 
gacgtgctct acctggggac agaggatgga cacctccacc gggctgtgcg cattggagct 1500 
cagctcagtg tcttggagga tctggccttg ttcccagaac cacagccggt tgagagcatg 1560 
aaattgtacc acgattggct cctggtgggc tcccatactg aggtgacaca agtgaacacc 1620 
agcaactgtg gccgtctcca gagctgctcg gagtgtatcc tggcccagga ccccgtgtgc 1680 
gcctggagct tccggcttga tgcttgtgtg gcccacgccg gcgagcaccg cgggatggtt 1740 
caagatatag agtcagcgga tgtctcttct ttgtgtccaa aagaacctgg agaacatccc 1800 
gtagtgtttg aagttccggt ggctactgtg ggccacgtgg tcctgccatg ttcccccagt 1860 
tctgcctggg catcctgtgt gtggcaccag cccagtggag tgactgcgct cactccccgg 1920 
agggatggac tagaggtggt ggtgacccca ggggccatgg gggcttatgc ttgcgagtgt 1980 
caggagggtg gagccgcccg cgtggtggct gcttatagct tggtgtgggg cagccagcgg 2040 
ggaccctcaa accgggccca caccgttgtg ggggctggat tggttggctt tctcctgggt 2100 
gttcttgcag catccctcac tctcctcctg attggtcgcc gtcagcagcg tcggcgacag 2160 
agggagcttc tagctagaga caaggtgggc ttagatctgg gggctccacc ttctgggacc 2220 
acaagctata gtcaggaccc tccctctcct tcgcctgaag atgaacggct gcccctggcc 2280 
ctgggtaagc ggggcagtgg ttttggtggc ttccctccac ccttcctgct ggattcttgc 2340 
ccaagcccag cccacatccg gctcactggg gcgcctctag ccacgtgtga tgagacctcc 2400 
atctaaagcc ggggaaaatg actgccagcc atgagcagtc tctggaacta gtggctacca 2460 
agaccatgat catggctgct cctttctctt ggagtctgtg tgttcacaca ttagtgtctg 2520 
tcctctggac ctggacctgg cctttgccca gattcctgat tctcatgaga gatcaaccct 2580 
gtaaccttct gcgatggcct cttgtcttgg gcccatcagc ttgtggggtg gagtaaggac 2640 
ataggccccg gaaagggaat cagtgtggag gtagttgggg cgtgtgtgcc ctgcgtcctt 2700 
gtggtggctg tatgatttcc cagtctgctg actctgggga gcgcatgatc ccctgactgc 2760 
cttgagatct ctcccaactc agtttcccct tgctctggaa gagtgtgtgt ctatacactg 2820 
gtgtgcctag aaggcctgtc catgtgtgca tggacgacag ggccggtgcc tcggtgcttt 2880 
tggggagtcg gagagaaagg ttggaatggg ggacaactta accctcggta gccagtgagg 2940 
gaaaccacat gcccgtcccc atcaccccac agcgcttctt taactttgag caaagttccc 3000 
aaagtgacct tctgggtggg aagggcagca ggacatgtgg cccccgtcct tctccttgtc 3060 
tttcccttct ggctgccaac cactgccgtg ccaccgctgc gctttccctg gctggagtgg 3120 
aggctgagtc ctctgtcctt ggtttccatt taaaatgaac ttcacaacat tctaaatatt 3180 
gggggatgac aaatgacttt tttccccaga aaagtgtgta ggaaatacaa gcaggttaaa 3240 
gaagatttgc ctcagtgact ttcacccttg ccctaaagca ggagtccctc agctagcgtc 3300 
tgtggactcc ctgaaattgt atgcgtctgt ggactccctg aaattgtatg caaagtgtct 3360 
gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt gtttgcgtgc atgtgtgcat 3420 
gtgtgtttga tggctttcat cagattctca aggccttaat gaggttaaag gaccacggcc 3480 
tatagtcacc acacttgggc cacatggagg aggtgttgct ctctgaggca gttcctccct 3540 
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ggcctgcctg aggccagccc ctggacacat tgctgctgga gaccccacat ctctccagaa 3600 

cttggaagct aggctctgcg cgtgcttgaa ggcaccacca tctcccttct tgcttcattc 3660 

tcctgtgtgc tctgcctctg ctcagtcctg ctcttggcct gtgaatgtgc ctcgcccgtc 3720 

cctggtgggg gacctcaaac cccagtgctg atgctaccct ttccagtggg agtttctgtt 3780 

ctgctttcct tgacagcagc ctgtgaacta ctcacgagtc cccttggttt ggagttcccg 3840 

gtggctttga gtaggatctt tggcgtggca tctaacctag cagcattgat cgttcattgt 3900 

aaagtgggga tatacctacc tcagggttgc tgcaaggatc aaacgaggaa acgtataaat 3960 
aaagcattac ccacagcaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaa 4008 

<210> 2 
<211> 2331 
<212> DNA 

<213> Rattus norvegicus 
<220> 

<223> Coding region from residue 1 to 2331 
<400> 2 

atgcttgcca gggccgagcg gccccgcccg ggcccccggc cgcctccggt ctttcccttc 60 

ccgccgccgc tgtcgctgct gctgctgctg gcgatactaa gcgccccggt gtgcggccgc 120 

gtcccccgct cagtgcccag aacctcgctg cccatctccg aggctgactc ctatctcacc 180 

cggtttgcag cgtctcatac gtacaattac tctgctctcc ttgtggatcc tgcctcccac 240 

acactttacg tcggtgcacg ggatagcatc ttcgctttaa ccctcccctt ctctggggaa 300 

agaccccgaa ggatcgactg gatggtacct gagactcaca gacagaactg caggaagaaa 360 

ggcaagaaag aggacgaatg tcacaatttt atccagattc tcgccattgt caatgcctct 420 

cacctcctca cgtgcggcac cttcgctttt gatccgaagt gcggggttat tgatgtgtcc 480 

agtttccagc aggttgaaag acttgagagc ggccggggga aatgtccttt tgagccagct 540 

caacggtcag cagctgtaat ggctgggggc gtcctctaca ccgccactgt gaagaacttc 600 

ctggggactg agcccatcat ctcccgagct gtgggtcgag ctgaggactg gattcgaaca 660 

gagaccttgt catcctggct taatgctcca gcctttgtcg cagctatggt cctgagccca 720 

gctgagtggg gggatgaaga tggagacgat gaaatctttt ttttcttcac ggagacctcc 780 

cgagtgttgg actcctatga gcgcatcaag gtcccaagag tggcccgagt gtgtgcgggg 840 

gaccttgggg gcaggaagac ccttcagcag agatggacga cgtttctgaa ggctgacctg 900 

ctgtgcccag ggcccgagca tggccgggcc tccggggttc tgcaggctat ggcagagctt 960 

cggcctcagc ctggagcggg aacccccatc ttttatggga tcttttcctc ccagtgggaa 1020 

ggagctgcca tctctgctgt gtgtgccttc cgaccccaag acatccgggc agtgctgaat 1080 

ggtcccttta gagagctaaa acatgactgc aacaggggac tgcctgtcat ggacaacgag 1140 

gtgccccagc ccagacctgg agagtgcatc gccaacaaca tgaagctcca gcagtttgga 1200 

tcctcactct ccctgccaga ccgcgtgctc acctttatca gagaccaccc tctcatggac 1260 

aggcccgtgt tcccggctga cggccgcccc ctgctggtca ctacagatac agcctatctc 1320 

agagtcgtgg cccacagggt gaccagcctc tcagggaaag aatatgacgt gctctacctg 1380 

gggacagagg atggacacct ccaccgggct gtgcgcattg gagctcagct cagtgtcttg 1440 

gaggatctgg ccttgttccc agaaccacag ccggttgaga gcatgaaatt gtaccacgat 1500 

tggctcctgg tgggctccca tactgaggtg acacaagtga acaccagcaa ctgtggccgt 1560 

ctccagagct gctcggagtg tatcctggcc caggaccccg tgtgcgcctg gagcttccgg 1620 

cttgatgctt gtgtggccca cgccggcgag caccgcggga tggttcaaga tatagagtca 1680 

gcggatgtct cttctttgtg tccaaaagaa cctggagaac atcccgtagt gtttgaagtt 1740 

ccggtggcta ctgtgggcca cgtggtcctg ccatgttccc ccagttctgc ctgggcatcc 1800 
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tgtgtgtggc accagcccag tggagtgact gcgctcactc cccggaggga tggactagag 1860 

gtggtggtga ccccaggggc catgggggct tatgcttgcg agtgtcagga gggtggagcc 1920 

gcccgcgtgg tggctgctta tagcttggtg tggggcagcc agcggggacc ctcaaaccgg 1980 

gcccacaccg ttgtgggggc tggattggtt ggctttctcc tgggtgttct tgcagcatcc 2040 

ctcactctcc tcctgattgg tcgccgtcag cagcgtcggc gacagaggga gcttctagct 2100 

agagacaagg tgggcttaga tctgggggct ccaccttctg ggaccacaag ctatagtcag 2160 

gaccctccct ctccttcgcc tgaagatgaa cggctgcccc tggccctggg taagcggggc 2220 

agtggttttg gtggcttccc tccacccttc ctgctggatt cttgcccaag cccagcccac 2280 

atccggctca ctggggcgcc tctagccacg tgtgatgaga cctccatcta a 2331 

<210> 3 
<211> 776 
<212> PRT 

<213> Rattus norvegicus 
<400> 3 

Met Leu Ala Arg Ala Glu Arg Pro Arg Pro Gly Pro Arg Pro Pro Pro 
1 5 10 15 

Val Phe Pro Phe Pro Pro Pro Leu Ser Leu Leu Leu Leu Leu Ala lie 
20 25 30 

Leu Ser Ala Pro Val Cys Gly Arg Val Pro Arg Ser Val Pro Arg Thr 
35 40 45 

Ser Leu Pro lie Ser Glu Ala Asp Ser Tyr Leu Thr Arg Phe Ala Ala 
50 55 60 

Ser His Thr Tyr Asn Tyr Ser Ala Leu Leu Val Asp Pro Ala Ser His 
65 70 75 80 

Thr Leu Tyr Val Gly Ala Arg Asp Ser lie Phe Ala Leu Thr Leu Pro 
85 90 95 

Phe Ser Gly Glu Arg Pro Arg Arg lie Asp Trp Met Val Pro Glu Thr 
100 105 110 

His Arg Gin Asn Cys Arg Lys Lys Gly Lys Lys Glu Asp Glu Cys His 
115 120 125 

Asn Phe lie Gin lie Leu Ala lie Val Asn Ala Ser His Leu Leu Thr 
130 135 140 

Cys Gly Thr Phe Ala Phe Asp Pro Lys Cys Gly Val lie Asp Val Ser 
145 150 155 160 

Ser Phe Gin Gin Val Glu Arg Leu Glu Ser Gly Arg Gly Lys Cys Pro 
165 170 175 
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Phe Glu Pro Ala Gin Arg Ser Ala Ala Val Met Ala Gly Gly Val Leu 
180 185 190 



Tyr Thr Ala Thr Val Lys Asn Phe Leu Gly Thr Glu Pro lie lie Ser 
195 200 205 

Arg Ala Val Gly Arg Ala Glu Asp Trp lie Arg Thr Glu Thr Leu Ser 
210 215 220 

Ser Trp Leu Asn Ala Pro Ala Phe Val Ala Ala Met Val Leu Ser Pro 
225 230 235 240 

Ala Glu Trp Gly Asp Glu Asp Gly Asp Asp Glu lie Phe Phe Phe Phe 
245 250 255 

Thr Glu Thr Ser Arg Val Leu Asp Ser Tyr Glu Arg lie Lys Val Pro 
260 265 270 

Arg Val Ala Arg Val Cys Ala Gly Asp Leu Gly Gly Arg Lys Thr Leu 
275 280 285 

Gin Gin Arg Trp Thr Thr Phe Leu Lys Ala Asp Leu Leu Cys Pro Gly 
290 295 300 

Pro Glu His Gly Arg Ala Ser Gly Val Leu Gin Ala Met Ala Glu Leu 
305 310 315 320 

Arg Pro Gin Pro Gly Ala Gly Thr Pro He Phe Tyr Gly He Phe Ser 
325 330 335 

Ser Gin Trp Glu Gly Ala Ala He Ser Ala Val Cys Ala Phe Arg Pro 
340 345 350 

Gin Asp He Arg Ala Val Leu Asn Gly Pro Phe Arg Glu Leu Lys His 
355 360 365 

Asp Cys Asn Arg Gly Leu Pro Val Met Asp Asn Glu Val Pro Gin Pro 
370 375 380 

Arg Pro Gly Glu Cys He Ala Asn Asn Met Lys Leu Gin Gin Phe Gly 
385 390 395 400 

Ser Ser Leu Ser Leu Pro Asp Arg Val Leu Thr Phe He Arg Asp His 
405 410 415 



Pro Leu Met Asp Arg Pro Val Phe Pro Ala Asp Gly Arg Pro Leu Leu 
420 425 430 
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Val Thr Thr Asp Thr Ala Tyr Leu Arg Val Val Ala His Arg Val Thr 
435 440 445 

Ser Leu Ser Gly Lys Glu Tyr Asp Val Leu Tyr Leu Gly Thr Glu Asp 
450 455 460 

Gly His Leu His Arg Ala Val Arg lie Gly Ala Gin Leu Ser Val Leu 
465 470 475 480 

Glu Asp Leu Ala Leu Phe Pro Glu Pro Gin Pro Val Glu Ser Met Lys 
485 490 495 

Leu Tyr His Asp Trp Leu Leu Val Gly Ser His Thr Glu Val Thr Gin 
500 505 510 

Val Asn Thr Ser Asn Cys Gly Arg Leu Gin Ser Cys Ser Glu Cys lie 
515 520 525 

Leu Ala Gin Asp Pro Val Cys Ala Trp Ser Phe Arg Leu Asp Ala Cys 
530 535 540 

Val Ala His Ala Gly Glu His Arg Gly Met Val Gin Asp lie Glu Ser 
545 550 555 560 

Ala Asp Val Ser Ser Leu Cys Pro Lys Glu Pro Gly Glu His Pro Val 
565 570 575 

Val Phe Glu Val Pro Val Ala Thr Val Gly His Val Val Leu Pro Cys 
580 585 590 

Ser Pro Ser Ser Ala Trp Ala Ser Cys Val Trp His Gin Pro Ser Gly 
595 600 605 

Val Thr Ala Leu Thr Pro Arg Arg Asp Gly Leu Glu Val Val Val Thr 
610 615 620 

Pro Gly Ala Met Gly Ala Tyr Ala Cys Glu Cys Gin Glu Gly Gly Ala 
625 630 635 640 

Ala Arg Val Val Ala Ala Tyr Ser Leu Val Trp Gly Ser Gin Arg Gly 
645 650 655 

Pro Ser Asn Arg Ala His Thr Val Val Gly Ala Gly Leu Val Gly Phe 
660 665 670 



Leu Leu Gly Val Leu Ala Ala Ser Leu Thr Leu Leu Leu lie Gly Arg 
675 680 685 
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Arg Gin Gin Arg 
690 

Gly Leu Asp Leu 
705 

Asp Pro Pro Ser 



Gly Lys Arg Gly 
740 

Asp Ser Cys Pro 
755 

Ala Thr Cys Asp 
770 



Arg Arg Gin Arg 
695 

Gly Ala Pro Pro 
710 

Pro Ser Pro Glu 
725 

Ser Gly Phe Gly 



Ser Pro Ala His 
760 

Glu Thr Ser lie 
775 



Glu Leu Leu Ala 
700 

Ser Gly Thr Thr 
715 

Asp Glu Arg Leu 
730 

Gly Phe Pro Pro 
745 

lie Arg Leu Thr 



Arg Asp Lys Val 



Ser Tyr Ser Gin 
720 

Pro Leu Ala Leu 
735 

Pro Phe Leu Leu 
750 

Gly Ala Pro Leu 
765 



<210> 4 

<211> 2315 

<212> DNA 

<213> Homo sapiens 

<220> 

<223> Coding region from residue 1 to 1764 
<220> 

<221> 3'UTR 
<222> () . . (2315) 

<400> 4 

gggggtgtcc tctatgctgc cactgtgaaa aactacctgg ggacggagcc aattatcacc 60 

agagcagtgg gtcgtgccga ggactggatt cggacagata ccttgccttc ctggctgaac 120 

gccccagcct ttgtcgcagc cgtggccttg agcccagccg aatgggggga tgaagatgga 180 

gacgacgaaa tctacttctt ctttacggag acttcccgag catttgactc atacgagcgc 240 

attaaagtcc cacgggtggc ccgtgtgtgt gcgggggacc tcgggggccg gaagaccctc 300 

cagcagagat ggacgacgtt tttgaaagct gacctgctct gtccagggcc tgagcatggc 360 

cgggcctcca gtgtcctgca ggatgttgct gtgcttcgac ctgagcttgg ggcagggact 420 

cccatctttt atggcatctt ttcttcccag tgggaggggg ctactatctc tgctgtctgt 480 

gccttccgac cacaagacat tcggacagtg ctgaatggtc ccttcagaga actaaaacat 540 

gactgcaaca gaggactgcc tgtcgtggac aatgatgtgc cccagcccag acctggagag 600 

tgcatcacca acaacatgaa gctccggcac tttggctcat ctctctccct gcctgaccgc 660 

gtactcacct tcatccggga ccacccactc atggacaggc cagtgtttcc agctgatggc 720 

caccccctgc tggtcactac agatacagcc tatctcagag tcgtggccca cagggtgacc 780 

agcctctcag ggaaagagta tgatgtgctc tacctgggga cagaggatgg acacctccac 840 

cgagcagtgc ggatcggagc tcagctcagc gttcttgaag atctggcctt attcccagag 900 
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ccacagccag ttgagaacat gaaattgtac cacagctggc tcctggttgg ctcccgtact 960 
gaggtgacac aagtgaatac aaccaactgt ggccgtctcc agagctgctc agagtgcatc 1020 
ctggcccagg acccagtctg tgcctggagc ttccggctgg atgagtgtgt ggcccatgcc 1080 
ggggagcacc gagggttggt ccaagacata gagtcagcag atgtctcctc tttgtgtcct 1140 
aaagagcctg gagaacgtcc agtagtgttt gaagttcccg tggctacagc tgcgcatgtg 1200 
gtcttgccat gttctccaag ctcagcatgg gcatcctgtg tgtggcacca gcccagtgga 1260 
gtgactgcac tcaccccccg gcgggatgga ctggaggtgg tggtgacccc aggggccatg 1320 
ggcgcttatg cctgtgaatg tcaggagggt ggggcagccc atgtggtagc agcttacagc 1380 
ttggtatggg gcagccagcg agatgctccg agccgggccc acacagtggg ggcgggactg 1440 
gctggcttct tcttggggat tctcgcagca tccctgactc tcattctgat tggtcggcgt 1500 
cagcagcgac ggcgacagag ggaacttctg gctagagaca aggtgggcct ggacctgggg 1560 
gctccacctt ctgggaccac aagctacagc caagaccctc cctccccctc tcctgaagat 1620 
gagcggttgc cgctggccct ggccaagagg ggcagtggct ttggtggatt ctcaccaccc 1680 
ttcctgcttg atccttgccc aagcccagcc cacattcggc taactggggc tcctctagcc 1740 
acatgtgatg aaacatccat ctagagctgg gcaaatgacc actagtgtat aagtgatcac 1800 
tggaacggag tgaccactga gatgctgggg gtcactgggc ctggaagacc atcccagcct 1860 
ctgagttctc tttgagtatg agtgattact tggattttag tatctgttct ctctgagcct 1920 
ggatgggctt ggggccagac ctttgcctga ttcctgattc ccatgagaaa tcagaactgc 1980 
tttctgcagc aaatcagggc ttccccctaa catctgaact cctgtaaacc ttcatccctg 2040 
gccccctatc ttgggcccat tagttttggg gatggggcac agggcatagc tatgactttg 2100 
ctttctggtt ggagcctggc cggaaggaag agccctggag gtggttgggg gcaaatgtgc 2160 
cctgagtcct tggggtggtt ctgcttattc ttcaagttta tctgaatctg tggggagtgc 2220 
atgatcccca tgttgcaata tggagtctct gccctgagat cttccccatc tcagttttcc 2280 
ttccatgaaa gagtacgtgt aaatacatag tgttc 2315 

<210> 5 

<211> 1761 

<212> DNA 

<213> Homo sapiens 

<220> 

<223> Coding region from residue 1 to 1761 
<400> 5 

gggggtgtcc tctatgctgc cactgtgaaa aactacctgg ggacggagcc aattatcacc 60 
agagcagtgg gtcgtgccga ggactggatt cggacagata ccttgccttc ctggctgaac 120 
gccccagcct ttgtcgcagc cgtggccttg agcccagccg aatgggggga tgaagatgga 180 
gacgacgaaa tctacttctt ctttacggag acttcccgag catttgactc atacgagcgc 240 
attaaagtcc cacgggtggc ccgtgtgtgt gcgggggacc tcgggggccg gaagaccctc 300 
cagcagagat ggacgacgtt tttgaaagct gacctgctct gtccagggcc tgagcatggc 360 
cgggcctcca gtgtcctgca ggatgttgct gtgcttcgac ctgagcttgg ggcagggact 420 
cccatctttt atggcatctt ttcttcccag tgggaggggg ctactatctc tgctgtctgt 480 
gccttccgac cacaagacat tcggacagtg ctgaatggtc ccttcagaga actaaaacat 540 
gactgcaaca gaggactgcc tgtcgtggac aatgatgtgc cccagcccag acctggagag 600 
tgcatcacca acaacatgaa gctccggcac tttggctcat ctctctccct gcctgaccgc 660 
gtactcacct tcatccggga ccacccactc atggacaggc cagtgtttcc agctgatggc 720 
caccccctgc tggtcactac agatacagcc tatctcagag tcgtggccca cagggtgacc 780 
agcctctcag ggaaagagta tgatgtgctc tacctgggga cagaggatgg acacctccac 840 
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cgagcagtgc ggatcggagc tcagctcagc gttcttgaag atctggcctt attcccagag 900 

ccacagccag ttgagaacat gaaattgtac cacagctggc tcctggttgg ctcccgtact 960 

gaggtgacac aagtgaatac aaccaactgt ggccgtctcc agagctgctc agagtgcatc 1020 

ctggcccagg acccagtct'g tgcctggagc ttccggctgg atgagtgtgt ggcccatgcc 1080 

ggggagcacc gagggttggt ccaagacata gagtcagcag atgtctcctc tttgtgtcct 1140 

aaagagcctg gagaacgtcc agtagtgttt gaagttcccg tggctacagc tgcgcatgtg 1200 

gtcttgccat gttctccaag ctcagcatgg gcatcctgtg tgtggcacca gcccagtgga 1260 

gtgactgcac tcaccccccg gcgggatgga ctggaggtgg tggtgacccc aggggccatg 1320 

ggcgcttatg cctgtgaatg tcaggagggt ggggcagccc atgtggtagc agcttacagc 1380 

ttggtatggg gcagccagcg agatgctccg agccgggccc acacagtggg ggcgggactg 1440 

gctggcttct tcttggggat tctcgcagca tccctgactc tcattctgat tggtcggcgt 1500 

cagcagcgac ggcgacagag ggaacttctg gctagagaca aggtgggcct ggacctgggg 1560 

gctccacctt ctgggaccac aagctacagc caagaccctc cctccccctc tcctgaagat 1620 

gagcggttgc cgctggccct ggccaagagg ggcagtggct ttggtggatt ctcaccaccc 1680 

ttcctgcttg atccttgccc aagcccagcc cacattcggc taactggggc tcctctagcc 1740 

acatgtgatg aaacatccat c 1761 

<210> 6 
<211> 587 
<212> PRT 

<213> Homo sapiens 
<400> 6 

Gly Gly Val Leu Tyr Ala Ala Thr Val Lys Asn Tyr Leu Gly Thr Glu 
15 10 15 

Pro He He Thr Arg Ala Val Gly Arg Ala Glu Asp Trp He Arg Thr 
20 25 30 

Asp Thr Leu Pro Ser Trp Leu Asn Ala Pro Ala Phe Val Ala Ala Val 
35 40 45 

Ala Leu Ser Pro Ala Glu Trp Gly Asp Glu Asp Gly Asp Asp Glu He 
50 55 60 

Tyr Phe Phe Phe Thr Glu Thr Ser Arg Ala Phe Asp Ser Tyr Glu Arg 
65 70 75 80 

He Lys Val Pro Arg Val Ala Arg Val Cys Ala Gly Asp Leu Gly Gly 
85 90 95 

Arg Lys Thr Leu Gin Gin Arg Trp Thr Thr Phe Leu Lys Ala Asp Leu 
100 105 110 

Leu Cys Pro Gly Pro Glu His Gly Arg Ala Ser Ser Val Leu Gin Asp 
115 120 125 

Val Ala Val Leu Arg Pro Glu Leu Gly Ala Gly Thr Pro He Phe Tyr 
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130 



135 



140 



Gly He Phe Ser Ser Gin Trp Glu Gly Ala Thr He Ser Ala Val Cys 
145 150 155 160 

Ala Phe Arg Pro Gin Asp He Arg Thr Val Leu Asn Gly Pro Phe Arg 
165 170 175 

Glu Leu Lys His Asp Cys Asn Arg Gly Leu Pro Val Val Asp Asn Asp 
180 185 190 

Val Pro Gin Pro Arg Pro Gly Glu Cys He Thr Asn Asn Met Lys Leu 
195 200 205 

Arg His Phe Gly Ser Ser Leu Ser Leu Pro Asp Arg Val Leu Thr Phe 
210 . 215 220 

He Arg Asp His Pro Leu Met Asp Arg Pro Val Phe Pro Ala Asp Gly 
225 230 235 240 

His Pro Leu Leu Val Thr Thr Asp Thr Ala Tyr Leu Arg Val Val Ala 
245 250 255 

His Arg Val Thr Ser Leu Ser Gly Lys Glu Tyr Asp Val Leu Tyr Leu 
260 265 270 

Gly Thr Glu Asp Gly His Leu His Arg Ala Val Arg He Gly Ala Gin 
275 280 285 

Leu Ser Val Leu Glu Asp Leu Ala Leu Phe Pro Glu Pro Gin Pro Val 
290 295 300 

Glu Asn Met Lys Leu Tyr His Ser Trp Leu Leu Val Gly Ser Arg Thr 
305 310 315 320 

Glu Val Thr Gin Val Asn Thr Thr Asn Cys Gly Arg Leu Gin Ser Cys 
325 330 335 

Ser Glu Cys lie Leu Ala Gin Asp Pro Val Cys Ala Trp Ser Phe Arg 
340 345 350 

Leu Asp Glu Cys Val Ala His Ala Gly Glu His Arg Gly Leu Val Gin 
355 360 365 



Asp He Glu Ser Ala Asp Val Ser Ser Leu Cys Pro Lys Glu Pro Gly 
370 375 380 



Glu Arg Pro Val Val Phe Glu Val Pro Val Ala Thr Ala Ala His Val 
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385 

Val Leu Pro Cys 

Gin Pro Ser Gly 
420 

Val Val Val Thr 
435 

Glu Gly Gly Ala 
450 

Ser Gin Arg Asp 
465 

Ala Gly Phe Phe 



390 

Ser Pro Ser Ser 
405 

Val Thr Ala Leu 



Pro Gly Ala Met 
440 

Ala His Val Val 
455 

Ala Pro Ser Arg 
470 

Leu Gly lie Leu 
485 



395 

Ala Trp Ala Ser 
410 

Thr Pro Arg Arg 
425 

Gly Ala Tyr Ala 



Ala Ala Tyr Ser 
460 

Ala His Thr Val 
475 

Ala Ala Ser Leu 
490 



400 

Cys Val Trp His 
415 

Asp Gly Leu Glu 
430 

Cys Glu Cys Gin 
445 

Leu Val Trp Gly 



Gly Ala Gly Leu 
480 

Thr Leu lie Leu 
495 



lie Gly Arg Arg 
500 

Asp Lys Val Gly 
515 

Tyr Ser Gin Asp 
530 

Leu Ala Leu Ala 
545 

Phe Leu Leu Asp 



Ala Pro Leu Ala 
580 



Gin Gin Arg Arg 



Leu Asp Leu Gly 
520 

Pro Pro Ser Pro 
535 

Lys Arg Gly Ser 
550 

Pro Cys Pro Ser 
565 

Thr Cys Asp Glu 



Arg Gin Arg Glu 
505 

Ala Pro Pro Ser 



Ser Pro Glu Asp 
540 

Gly Phe Gly Gly 
555 

Pro Ala His lie 
570 

Thr Ser lie 
585 



Leu Leu Ala Arg 
510 

Gly Thr Thr Ser 
525- 

Glu Arg Leu Pro 



Phe Ser Pro Pro 
560 

Arg Leu Thr Gly 
575 



<210> 7 

<211> 196 

<212> DNA 

<213> Homo sapiens 

<220> 

<223> Coding region from residue 1 to 196 
<400> 7 
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aaattgtacc acagctggct cctggttggc 
accaactgtg gccgtctcca gagctgctca 
gcctggagct tccggctgga tgagtgtgtg 
• caagacatag agtcag 



tcccgtactg aggtgacaca agtgaataca 60 
gagtgcatcc tggcccagga cccagtctgt 120 
gcccatgccg gggagcaccg agggttggtc 180 

196 



<210> 8 
<211> 30 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: Syntheitc DNA 



<400> 8 

gataaggatc cgggtcgccg tcagcagcgt 30 

<210> 9 
<211> 27 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: Syntheitc DNA 



<400> 9 

ggctggaatt cattttcccc ggcttta 27 

<210> 10 

<211> 333 

<212> DNA 

<213> Homo sapiens 



<220> 

<223> Coding region from residue 1 to 333 



<400> 10 

ccccggccgg gtcccgggca gcctacagcc tcgcccttcc cgctactgct gctggcggtg 60 

ctgagcggcc cggtatccgg ccgcgtcccc cgctcggtgc ccagaacctc gcttccaatc 120 

tctgaggctg acttctgtct cacccggttc gcagtccctc acacatacaa ttactctgtt 180 

ctccttgtgg atcctgcctc ccacacactt tatgttggcg cccgggacac catcttcgct 240 

ttatccctgc ccttctcagg ggagagaccc cgcaggattg actggatggt tcctgaggct 300 

cacagacaga actgtaggaa gaaaggcaag aaa 333 



<210> 11 
<211> 111 
<212> PRT 

<213> Homo sapiens 
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<400> 11 
Pro Arg Pro Gly 
1 

Leu Leu Ala Val 
20 

Val Pro Arg Thr 
35 

Arg Phe Ala Val 
50 

Pro Ala Ser His 
65 

Leu Ser Leu Pro 



Val Pro Glu Ala 
100 



Pro Gly Gin Pro 
5 

Leu Ser Gly Pro 



Ser Leu Pro lie 
40 

Pro His Thr Tyr 
55 

Thr Leu Tyr Val 
70 

Phe Ser Gly Glu 
85 

His Arg Gin Asn 



Thr Ala Ser Pro 
10 

Val Ser Gly Arg 
25 

Ser Glu Ala Asp 



Asn Tyr Ser Val 
60 

Gly Ala Arg Asp 
75 

Arg Pro Arg Arg 
90 

Cys Arg Lys Lys 
105 



Phe Pro Leu Leu 
15 

Val Pro Arg Ser 
30 

Phe Cys Leu Thr 
45 

Leu Leu Val Asp 



Thr He Phe Ala 
80 

He Asp Trp Met 
95 

Gly Lys Lys 
110 



<210> 12 
<211> 7 
<212> PRT 

<213> Homo sapiens 
<400> 12 

Gin Asp Pro Val Cys Ala Trp 
1 5 



<210> 13 
<211> 6 
<212> PRT 

<213> Homo sapiens 
<220> 

<223> First Xaa = Gin or Arg 
<220> 

<223> Second Xaa = Ala or Gly 
<400> 13 

Xaa Asp Tyr Cys Xaa Trp 
1 5 
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% 



# 



<210> 14 
<211> 14 
<212> PRT 
<213> Unknown 

<220> 

<223> Description of Unknown Organism: Myc tag 
<400> 14 

Asp lie Gly Gly Glu Gin Lys Leu lie Ser Glu Glu Asp Leu 
15 10 
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